Back

American Journal of Epidemiology

54 training papers 2019-06-25 – 2026-03-07

Top medRxiv preprints most likely to be published in this journal, ranked by match strength.

1
Moving beyond risk ratios in sibling analysis: estimating clinically useful measures from family-based analysis
2025-05-16 epidemiology 10.1101/2025.05.16.25327702
#1 (12.1%)
Show abstract

ObjectiveFindings from family-based analyses, such as sibling comparisons, are often reported using only odds ratios or hazard ratios. We demonstrate how this can be improved upon by applying the marginalized between-within framework. Study Design and SettingWe provide an overview of sibling comparison methods and the marginalized between-within framework, which enables estimation of absolute risks and clinically relevant metrics while accounting for shared familial confounding. We illustrate t...

2
Unmeasured confounding and misclassification in vaccine effectiveness studies using electronic health records (EHRs): an evaluation of a multi country European study (VEBIS-EHR)
2025-05-01 epidemiology 10.1101/2025.04.30.25326722
#1 (11.3%)
Show abstract

BackgroundElectronic health record (EHR)-based observational studies can rapidly provide real-world data on vaccine effectiveness (VE), particularly key during the COVID-19 pandemic. However, EHR data may be prone to misclassification and unmeasured confounding, requiring systematic mitigation to ensure robust findings. MethodsIn VEBIS-EHR, a retrospective multi-country COVID-19 VE cohort study, we examined unmeasured confounding using a negative control outcome (death not related to COVID-19) ...

3
Bias correction methods for test-negative designs in the presence of misclassification
2019-07-20 epidemiology 10.1101/19002691
#1 (11.2%)
Show abstract

AO_SCPLOWBSTRACTC_SCPLOWThe test-negative design has become a standard approach for vaccine effectiveness studies. However, previous studies suggested that it may be more sensitive than other designs to misclassification of disease outcome caused by imperfect diagnostic tests. This could be a particular limitation in vaccine effectiveness studies where simple tests (e.g. rapid influenza diagnostic tests) are used for logistical convenience. To address this issue, we derived a mathematical repres...

4
Cluster-weighted modified Poisson regression for estimating risk ratios in longitudinal data with informative cluster sizes
2025-05-25 epidemiology 10.1101/2025.05.23.25328253
#1 (11.1%)
Show abstract

Variation in binary outcomes over time by cluster size arises across various biomedical disciplines, including reproductive health, dental medicine, and psychiatric epidemiology. This study formally integrates modified Poisson regression with cluster-weighted generalized estimating equations (MP-CWGEE) for computing risk ratios in longitudinal studies with informative cluster sizes. Using a comprehensive Monte-Carlo simulation study, we empirically evaluated MP-CWGEEs statistical properties agai...

5
Initial Description and Demonstration of a Family of Methods with the Potential to Approximate Confounding
2025-11-02 epidemiology 10.1101/2025.10.30.25339088
#1 (11.0%)
Show abstract

Confounding is one of the most important concerns for randomized or nonrandomized intervention or exposure studies. This manuscript describes several metrics intended to provide quantitative approximations of confounding under certain conditions. Each metric quantifies differences in risk between intervention arms during time periods when the intervention (or exposure of interest) is not occurring. Because exposure is absent, these metrics have the potential to summarize the effects of other mea...

6
Revisiting Bias in Odds Ratios
2021-03-02 epidemiology 10.1101/2021.02.28.21252604
#1 (10.4%)
Show abstract

Ratio measures of effect, such as the odds ratio (OR), are consistent, but the presumption of their unbiasedness is founded on a false premise: The equality of the expected value of a ratio and the ratio of expected values. We show that the invalidity of this assumptions is an important source of empirical bias in ratio measures of effect, which is due to properties of the expectation of ratios of count random variables. We investigate ORs (unconfounded, no effect modification), proposing a corr...

7
qbaconfound: A flexible Monte Carlo probabilistic bias analysis for unmeasured confounding
2025-08-14 epidemiology 10.1101/2025.08.12.25333217
#1 (9.1%)
Show abstract

BackgroundUnmeasured confounding is a persistent concern in observational studies. We can quantitatively assess the impact of unmeasured confounding using a quantitative bias analysis (QBA). A QBA specifies the relationship between the unmeasured confounder(s), U, and study data via its bias parameters. There are two broad classes of QBA methods: deterministic and probabilistic. We focus on a probabilistic QBA which incorporates external information about U via prior distribution(s) placed on t...

8
Comparative performance of the concurrent comparator design with existing vaccine safety surveillance approaches on real-world observational health data
2026-01-26 public and global health 10.64898/2026.01.25.26344812
#1 (8.7%)
Show abstract

BackgroundIt is critical public health concern to identify safety signals originating from wide-scale immunization efforts. Such safety signals may be identified from spontaneous reports and other data sources. Although some work has been done on the best methods for vaccine safety surveillance, there is a scarcity of information on how these perform in analyses of real-world data. MethodsWe use four administrative claims databases and one electronic health record (EHR) database to evaluate the...

9
Relative risks, the probability of necessity, and attributable fractions
2024-07-04 epidemiology 10.1101/2024.07.03.24309898
#1 (8.5%)
Show abstract

Epidemiologists are careful to describe their findings as "associations", and to avoid any causal language or claims. Arguably, this attempt to avoid reference to causal processes has become counterproductive. Explicitly stated or not, assumptions about causal processes are inherent in the formulation and interpretation of any statistical study. This article offers a bridge between established, extensively developed proportional hazard methods that are used to study longitudinal observational co...

10
Multiple imputation of missing data under missing at random: including a collider as an auxiliary variable in the imputation model can induce bias
2023-06-18 epidemiology 10.1101/2023.06.16.23291497
#1 (8.4%)
Show abstract

Epidemiological studies often have missing data, which are commonly handled by multiple imputation (MI). In MI, in addition to those required for the substantive analysis, imputation models often include other variables ("auxiliary variables"). Auxiliary variables that predict the partially observed variables can reduce the standard error (SE) of the MI estimator and, if they also predict the probability that data are missing, reduce bias due to data being missing not at random. However, guidanc...

11
Accounting for external factors and early intervention adoption in the design and analysis of stepped-wedge designs: Application to a proposed study design to reduce opioid-related mortality
2020-07-29 epidemiology 10.1101/2020.07.26.20162297
#1 (7.7%)
Show abstract

BackgroundStepped-wedge designs (SWDs) are currently being used to investigate interventions to reduce opioid overdose deaths in communities located in several states. However, these interventions are competing with external factors such as newly initiated public policies limiting opioid prescriptions, media awareness campaigns, and social distancing orders due to the COVID-19 pandemic. Furthermore, control communities may prematurely adopt components of the proposed intervention as they become ...

12
Best of intent, worst of both worlds: why sequentially combining epidemiological designs does not improve signal detection in vaccine safety surveillance
2022-02-22 epidemiology 10.1101/2022.02.18.22271183
#1 (7.6%)
Show abstract

BackgroundVaccine safety surveillance commonly includes a serial testing approach with a sensitive method for "signal generation" and specific method for "signal validation." Whether serially combining epidemiological designs improves both sensitivity and specificity is unknown. MethodsWe assessed the overall performance of serial testing using three administrative claims and one electronic health record database. We compared Type I and II errors before and after empirical calibration for histo...

13
Application of the Adaptive Validation Design to estimate the association between transmasculine/transfeminine status and self-inflicted injury among transgender and gender-nonconforming children and adolescents.
2020-02-23 epidemiology 10.1101/2020.02.20.20024182
#1 (7.5%)
Show abstract

AO_SCPLOWBSTRACTC_SCPLOWO_ST_ABSBackgroundC_ST_ABSAn internal validation substudy compares an imperfect measurement of a variable with a gold standard measurement in a subset of the study population. Validation data permit calculation of a bias-adjusted estimate, expected to equal the association that would have been observed had the gold standard measurement been available for the entire study population. Guidance on optimal sampling of participants to include in validation substudies has not c...

14
Assessing the impact of limit of detection on HAI titer increase estimation in vaccine studies
2022-08-30 epidemiology 10.1101/2022.08.25.22279230
#1 (7.4%)
Show abstract

In many laboratory assays that measure immunological quantities, a portion of the measured values fall below a limit of detection (LOD). This is also the case for the hemagglutination inhibition assay (HAI), a common method used to quantify antibodies in influenza research. The conventional approach is to treat values below the LOD as either equal to the LOD or LOD/2, which can introduce potential biases. These biases can become more pronounced when calculating compound measures such as the diff...

15
Association of cognitive impairment with adverse childhood experiences: 2019 Results from 21 states
2025-11-06 epidemiology 10.1101/2025.11.05.25339606
#1 (7.3%)
Show abstract

Our objective was to assess associations between 8 adverse childhood experiences (ACEs; physical, emotional, and sexual abuse as a child, or living in household with substance abuse, mental health problems, divorce, intimate partner violence, or incarceration) and subjective cognitive impairment (SCI: the disability question from the 2019 Behavioral Risk Factor Surveillance Surveys) and compare with cardiovascular disease (CVD) results. MethodsAdults (N=149,801) from 21 states with data on ACEs...

16
Considerations for using multiple imputation in propensity score-weighted analysis
2021-07-05 epidemiology 10.1101/2021.06.30.21259793
#1 (7.3%)
Show abstract

We present our considerations for using multiple imputation to account for missing data in propensity score-weighted analysis with bootstrap percentile confidence interval. We outline the assumptions underlying each of the methods and discuss the methodological and practical implications of our choices and briefly point to alternatives. We made a number of choices a priori for example to use logistic regression-based propensity scores to produce "standardized mortality ratio"-weights and Substan...

17
Early-life socioeconomic position and the accumulation of health-related deficits by midlife in the 1958 British birth cohort study
2020-09-15 epidemiology 10.1101/2020.09.14.20193961
#1 (7.2%)
Show abstract

Reducing population levels of frailty is an important goal and preventing its development in mid-adulthood could be pivotal. Childhood socioeconomic position (SEP) is associated with a myriad of adult health outcomes but evidence is limited on associations with frailty. Using 1958 British birth cohort data (N=8711), we aimed to: (i) establish the utility of measuring frailty in mid-life, by examining associations between a 34-item frailty index at 50y (FI50y) and mortality over an eight-year fol...

18
Multiple imputation assuming missing at random: auxiliary imputation variables that only predict missingness can increase bias due to data missing not at random
2023-10-17 epidemiology 10.1101/2023.10.17.23297137
#1 (7.2%)
Show abstract

Epidemiological studies often have missing data, which are commonly handled by multiple imputation (MI). MI is valid (given correctly-specified models) if data are missing at random, conditional on the observed data, but not (unless additional information is available) if data are missing not at random (MNAR). In this paper we explore a previously-suggested strategy, namely, including an auxiliary variable predictive of missingness but not the missing data in the imputation model, when data are ...

19
Life-course neighbourhood deprivation and brain structure in older adults: The Lothian Birth Cohort 1936
2023-04-17 epidemiology 10.1101/2023.04.13.23288523
#1 (7.1%)
Show abstract

Neighbourhood disadvantage may be associated with brain health but the importance at different stages of the life course is poorly understood. Utilizing the Lothian Birth Cohort 1936, we explored the relationship between residential neighbourhood deprivation from birth to late adulthood, and global and regional neuroimaging measures at age 73. We found that residing in disadvantaged neighbourhoods in mid- to late adulthood was associated with smaller total brain ({beta}=-0.06; SE=0.02; n=390) an...

20
Causal Inference via Electronic Health Records in the National Clinical Cohort Collaborative: Challenges and Solutions in Long COVID Research
2025-06-08 epidemiology 10.1101/2025.06.06.25329168
#1 (7.0%)
Show abstract

Observational analyses of electronic health record (EHR) data using databases such as the National Clinical Cohort Collaborative include unique challenges for researchers seeking causal inferences, particularly when evaluating subjectively-defined outcomes like Long COVID. We explore several challenges and describe potential solutions. 1. Lack of true negatives: Many diagnoses and conditions either have a positive indicator or a missing status, requiring investigators to carefully consider which...